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1 Introduction 

A principal goal of variational analysis is the search for generalized critical points of nons- 
mooth functions /: R" — )• R. For example, given a locally Lipschitz function /, we might be 
interested in points x 6 R" having zero in the "Clarke generalized gradient" (or "subdiffer- 
ential") dcf(x), a set consisting of convex combinations of limits of gradients of / at points 
near jc 1141 . 

Adding a linear perturbation, we might seek critical points of the function x t-^ f(x) — 
v^x for a given vector v 6 R'", or, phrased in terms of the graph of the subdifferential map- 
ping dcf, solutions to the inclusion 

(x,v) 6 gphdj. 

More generally, given a smooth function G: R'" — > R" , we might be interested in solutions 
{x,y) e R'" X R" to the system 

{G{x),y) e gphdj and VGW*y = v (1) 
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(where * denotes the adjoint). Such systems arise naturally when we seek critical points of 
the composite function x i-^ f {G{x)) — v^x. 

Generalized critical points of smooth functions / are, of course, simply the critical points 
in the classical sense. However, the more general theory is particularly interesting to opti- 
mization specialists, because critical points of continuous convex functions are just minimiz- 
ers 031 Proposition 8.12], and more generally, for a broader class of functions (for instance, 
those that are Clarke regular 1 14J), a point is critical exactly when the directional derivative 
is nonnegative in every direction. 

The system (O could, in principle, be uninformative if the graph gphdcf is large. In 
particular, if the dimension (appropriately defined) of the graph is larger than «, then we 
could not typically expect the system to be a very definitive tool, since it involves m + n 
variables constrained by only m linear equations and the inclusion. Such examples are not 
hard to construct: indeed, there exists a function /: R — > R with Lipschitz constant one 
and with the property that its Clarke subdifferential is the interval [—1,1] at every point 
021 . Alarmingly, in a precise mathematical sense, this property is actually typical for such 
functions 1 10|. 

Optimization theorists often consider subdifferentials that are smaller than Clarke's, the 
"limiting" subdifferential df being a popular choice 133 , 15 28, 1 1 1. However, the Clarke 
subdifferential can be easier to approximate numerically (see 112|), and in any case the 
potential difficulty posed by functions with large subdifferential graphs persists with the 
limiting subdifferential |7 |. 

Notwithstanding this pathology, concrete functions /: R" R encountered in practice 
have subdifferentials d^f whose graphs are, in some sense, small and this property can be 
useful, practically. For instance, Robinson fTP\ considers algorithmic aspects of functions 
whose subdifferential graphs are everywhere locally Lipschitz homeomorphic to an open 
subset of R". As above, dimensional considerations suggest reassuringly that this property 
should help the definitive power of critical point systems like (|T]|, and Robinson furthermore 
argues that it carries powerful computational promise. An example of the applicability of 
Robinson's techniques is provided by Minty's theorem, which states that the graph of the 
subdifferential of a proper, lower semicontinuous, convex function /: R" — s- R is Lipschitz 
homeomorphic to R" [271. 

When can we be confident that a function has a subdifferential graph that is, by some 
definition, small? The study of classes of functions that are favorable for subdifferential 
analysis, in particular excluding the pathological examples above, is well-developed. The 
usual starting point is a unification of smooth and convex analysis, arriving at such proper- 
ties as amenability 031 Chapter 10. R], prox-regularity |30|, and cone-reducibility fSl Sec- 
tion 3.4.4]. Using Minty's theorem, Poliquin and Rockafellar [30] showed that prox-regular 
functions, in particular, have small subdifferentials in the sense of Robinson. Aiming pre- 
cisely at a class of functions with small subdifferentials (in fact minimal in the class of upper 
semicontinuous mappings with nonempty compact convex images), (8) considers "essential 
strict differentiability". 

In this work we take a different, very concrete approach. We focus on the dimension of 
the subdifferential graph, unlike the abstract minimality results of |8], but we consider the 
class of semi-algebraic functions — those functions whose graphs are semi-algebraic, mean- 
ing composed of finitely-many sets, each defined by finitely-many polynomial inequalities — 
and prove that such functions have small subdifferentials in the sense of dimension: the 
Clarke subdifferential has n-dimensional graph. This result subsumes neither the simple 
case of a smooth function, nor the case of a convex function, neither of which is necessarily 
semi-algebraic. Nonetheless, it has a certain appeal: semi-algebraic functions are common. 
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they serve as an excellent model for "concrete" functions in variational analysis f22ll . and 
in marked contrast with many other classes of favorable functions, such as amenable func- 
tions, they may not even be Clarke regular Furthermore, semi-algebraic functions are easy 
to recognize (as a consequence of the Tarski-Seidenberg theorem on preservation of semi- 
algebraicity under projection). For instance, observe that the spectral radius function on n x « 
matrices is neither Lipschitz nor convex, but it is easy to see that it is semi-algebraic. 

To illustrate our results, consider the critical points of the function x i— > f{x) — v^x for 
a semi-algebraic function /:R" [— °°, +°°\. As a consequence of the subdifferential graph 
being small, we show that for a generic choice of the vector v, the number of critical points is 
finite. More precisely, there exists a number A^, and a semi-algebraic set 5 C R" of dimension 
strictly less than n, such that for all vectors v outside S, there exist at most A' critical points. 
A result of a similar flavor can be found in 1231 , where criticality of so called "constraint 
systems" is considered. Specifically, f231 shows that if a semi-algebraic constrained mini- 
mization problem is "normal", then it has only finitely many critical points. Furthermore, it 
is shown that normality is a generic property. To contrast their approach to ours, we should 
note that |23 1 focuses on perturbations to the constraint structure, whereas we address linear 
perturbations to the function itself. 

To be concrete, we state our results for semi-algebraic functions. Analogous results, 
with essentially identical proofs, hold for functions definable in an "o-minimal structure" 
and, more generally, for "tame" functions. (In the case of tame functions, "finiteness" of 
critical points should be replaced by "local isolation" in Proposition 4.3 and Corollaries 
4.4. 15.81 [S!9l ) In particular, our results hold for globally subanalytic functions, discussed in 
061 . For a quick introduction to these concepts in an optimization context, see |22|. 

2 Preliminaries 

2. 1 Variational Analysis 

In this section, we summarize some of the fundamental tools used in variational analysis 
and nonsmooth optimization. We refer the reader to the monographs of Rockafellar-Wets 
03] , Borwein-Zhu |11|, Mordukhovich 128 ,29], and Clarke-Ledyaev-Stern-Wolenski llTSl , 
for more details. Unless otherwise stated, we follow the terminology and notation of 1331 . 

Consider the extended real line R := R U {— °°} U {+°°}. We say that an extended-real- 
valued function is proper if it is never {— °°} and is not always {+°°}. 

For a function /: R" — >■ R, we define the domain of / to be 

dom/ := {x e R" : /(x) < -F°o}, 

and we define the epigraph of / to be 

epi/:={(.v,r)eR"xR:r>/W}. 

A set-valued mapping F from R" to R'", denoted hy F:W ^ R™, is a mapping from R" 
to the power set of R"'. Thus for each point x 6 R", F{x) is a subset of R™. For a set- valued 
mapping F: R" ^ R'", we define the domain of F to be 

domf := [xeW ■.F{x)^ib}, 

and we define the graph of F to be 

gphF := {(x,j.) e R" X R'" : y 6 
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Definition 2.1 Consider a set-valued mapping F: R" =| R™. 

1 . F is outer semicontinuous at a point x e R" if for any sequence of points Xr £ R" converg- 
ing to X and any sequence of points yr 6 F{xr) converging to y, we must have y e F{x). 

2. F is inner semicontinuous at X if for any sequence of points Xr G R" converging to x and 
any point j G there exists a sequence yr G R"* converging to y such that yr G F{xr) 
for all r. 

If both properties hold, then we say that F is continuous at x. 

Definition 2.2 Consider a set 5 C R" and a point i G 5. The regular normal cone to 5 at x, 
denoted Ns[x), consists of all vectors v G R" such that 

{v,x — x) < o{\x — x\) for X G S, 

where we denote by o(|jc — i|) for x G 5 a term with the property that 

o{\x-x\) 
\x — x\ 

when X A X with x^x. 

Given a closed set S, the mapping x ^ Ns{x) does not necessarily have a closed graph. 
To correct for that, the following definition is introduced. 

Definition 2.3 Consider a set 5 C R" and a point x e S. The limiting normal cone to S at x, 
denoted Ns{x), consists of all v G R" such that there are sequences Xr^x and — >• v with 

VreNsiXr)- 

For a set S C R", we denote its topological closure by clS and its convex huU by convS. 

Definition 2.4 Consider a set 5 C R" and a point xeS. The Clarke normal cone to S at x, 
denoted Ng{x), is defined by 

Ng{x) = cl conv Ns{x). 
We summarize some simple facts about normal cones that we will need. 
Theorem 2.5 Consider a set 5 C R" and a point x G 5. 

1. Nsix) cNsix) G Nlix). 

2. Ns{x), Ns{x), andNl{x) are closed cones. Ns{x) andNg{x) are, in addition, convex. 

3. For aset F C R" containing x such that S C F, we have Nf{x) C Ns{x). 

Definition 2.6 (Clarke regularity of sets) A set 5 C R" is said to be Clarke regular at a 
point X G 5 if it is locally closed at x and every limiting normal vector to 5 at x is a regular 
normal vector, that is Ns{x) = Ns{x). 

Given any set 5 C R" and a mapping /: 5 — > 5, where S C R", we say that / is smooth 
if for each point x G 5, there is a neighborhood U of x and a C* mapping /: R" — > R™ that 
agrees with f oa SHU. If a smooth function / is bijective and its inverse is also smooth, 
then we say that / is a dijfeomorphism. 

What we call smooth is usually referred to as C' smooth. Since in this work we will not 
need higher order of smoothness, no ambiguity should arise. 
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Definition 2.7 (|25 , Proposition 8.12]) Consider a set M c R". We say that M is a manifold 
(or "embedded submanifold") of dimension r if for eacli point x E M, Ihere is an open 
neigliborhood U around x sucli tliat MDU = F^' (0), where F:U ^ is a smooth map 
with VF(x) of full rank. In this case, we call F a local defining function for M around x. 

Theorem 2.8 (133., Example 6.8]) IfM is a manifold, then for every point x e M, the man- 
ifold M is Clarke regular at x and Nm{x) is equal to the normal space to M at x, in the sense 
of differential geometry. 

Normal cones allow us to study geometric objects. We now define subdifferentials, 
which allow us to analyze behavior of functions. 

Definition 2.9 Consider a function /: R" — > R and a point x 6 R" where / is finite. The 

regular, limiting, and Clarke subdifferentials of / at x, respectively, are defined by 

df{x) = {v e R" : (v, -1) e 4pi/(^,/(^))}, 
df{x) = {ve R" : (v, -1) e N,^,f{x,f{x))}, 
dj{x) = {v e R" : (v, -1) 6 K^,f{x,f{x))]. 

For X such that f{x) is not finite, we follow the convention that df{x) = df{x) = dcf{x) = 0. 

Definition 2.10 (Subdifferential regularity) A function /:R" R is called subdifferen- 
tially regular at x if /(x) is finite and epi/ is Clarke regular at as a subset of 

R" xR. 

Theorem 2.11 (fST, Exercise 8.8, Corollary 10.9]) Consider the function h = f+g, where 
f: R" R is finite at x and g: R" — > R is smooth on a neighborhood ofx. Then we have 

dh{x) = df{x)+Vg{x), dh{x) = df{x)+Vg(x). 

Furthermore, h is subdifferentially regular at x if and only if f is subdifferentially regular at 

X. 

For a set 5 C R", we define 5s'. R" ^ R to be a function that is on 5 and +o° elsewhere. 
We call 5s the indicator function of the set S. 

Theorem 2.12 ( 1331 Exercise 8.14]) Consider the indicator function 5$ of a set S <Z R". 

Then we have 

d5s(x)=Ns{x), d5s{x)=Ns{x). 
Furthermore, 5s is subdifferentially regular at x if and only if S is Clarke regular at x. 

2.2 Semi-algebraic Geometry 

A semi-algebraic set 5 C R" is a finite union of sets of the form 

{x e R" : P{x) = 0, gi (x) < 0, . . . , Qi {x) < 0}, 

where P,Qi, . ■ . ,Qi are polynomials in n variables. In other words, 5 is a union of finitely 
many sets, each defined by finitely many polynomial equalities and inequalities. A map 
F:R" ^ R'" is semi-algebraic if gphf" C R"+'" is a semi-algebraic set. Semi-algebraic sets 
enjoy many nice structural properties. We discuss some of these properties in this section. 
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See the monographs of Basu-PoUack-Roy 1 1 1, Lou van den Dries 071 . and Shiota lf36l . For 
a quick survey, see the article of van den Dries-Miller ISSJ and the surveys of Coste I17II16I . 
Unless otherwise stated, we follow the notation of | 38l and 1171 . 

A fundamental fact about semi-algebraic sets is provided by the Tarski-Seidenberg The- 
orem I n, Theorem 2.3]. It states that the image of any semi-algebraic set S C R", under a 
projection to any linear subspace of R", is a semi-algebraic set. From this result, it follows 
that a great many constructions preserve semi-algebraicity. In particular, for a semi-algebraic 
function /: R" — > R, it is easy to see that the set- valued mappings df, df, and dcf are semi- 
algebraic. See for example |22 Proposition 3.1]. 

The most striking and useful fact about semi-algebraic sets is that they can be parti- 
tioned into finitely many semi-algebraic manifolds that fit together in a regular pattern. The 
particular stratification that we are interested in is defined below. 

Definition 2.13 Consider a semi-algebraic set Q in R". A Whitney stratification of Q is 
a finite partition of Q into semi-algebraic manifolds Mj (called strata) with the following 
properties: 

1. For distinct / and if M; n c\Mj ^ 0, then M, C clM^ \ M,-. 

2. For any sequence of points (xt) in a stratum Mj converging to a point in a stratum M,, 
if the corresponding normal vectors £ Nm, (^i ) converge to a vector y, then y 6 Nm, {x) . 

Observe that property 1 of Definition 12.131 gives us topological information on how 
the strata fit together, while property 2 gives us control over how sharply the strata fit to- 
gether. Property 1 is called the frontier condition and property 2 is called Whitney condition 
(a). We should note that Whitney stratification, as defined above, is normally referred to as 
-Whitney stratification. Furthermore, Whitney condition (a) is usually stated somewhat 
differently. The equivalence is noted in |21 1. One simple example of this type of a stratifi- 
cation to keep in mind throughout the discussion is the partition of a polytope into its open 
faces. 

Definition 2.14 Given finite collections {5,} and {Cj} of subsets of R", we say that {S,} is 
compatible with {Cj} if for all 5, and Cj, either S, fl = or 5, C Cj. 

As discussed above, the following theorem is true. 

Theorem 2.15 (|38, Theorem 4.8]) Let Q,C\,. . . ,Ci be semi-algebraic sets in R". Then Q 
admits a Whitney stratification that is compatible with C\,.. .,Ci. 

The notion of a stratification being compatible with some predefined sets might not 
look natural; in fact, it is crucial since this property enables us to construct refinements of 
stratifications 

We will have occasion to use the following result. 

Theorem 2.16 (138 , Theorem 4.8]) Consider a semi-algebraic set S in R" and a semi- 
algebraic map f'.S^ R'". Let be a finite collection of semi-algebraic subsets of S and 
a finite collection of semi-algebraic subsets ofR"'. Then there exists a Whitney stratifi- 
cation si' of S that is compatible with si and a Whitney stratification ofR'" compatible 
with ^ such that for every stratum Q £ s/', we have that the restriction /|q is smooth and 

f(Q) e 

In particular, it follows that semi-algebraic maps are "generically" (in a sense about to 
be made clear) smooth. 
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Definition 2.17 Let A C R" be a nonempty semi-algebraic set. Then we define the dimen- 
sion of A, dimA, to be the maximal dimension of a stratum in any Whitney stratification of 
A. We adopt the convention that dim0 = — oo. 

It can be easily shown that the dimension does not depend on the particular stratification. 
See 071 Chapter 4] for more details. 

Theorem 2.18 Let A and B be nonempty semi-algebraic sets in R". Then the following hold. 

1. If A C B, then dimA < dimS. 

2. dimA = dimclA. 

3. dim(clA \A) < dimA. 

4. Iff'.A — > R" is a semi-algebraic mapping, then dim/(A) < dimA. Iff is one-to-one, then 
dim /(A) = dimA. In particular, semi-algebraic homeomorphisms preserve dimension. 

5. dimA U 5 = max{dimA, dimS}. 

6. dimA X 5 = dimA + dim S. 

We will need the following simple proposition. 

Proposition 2.19 Consider a Whitney stratification {M, } of a semi-algebraic set Q C R". 
Let Mj be a stratum of maximal dimension. Then for any point x £ Mj, there exists a neigh- 
borhood B cK" around x so that 

BnQ = BnMj. 

Proof Assume otherwise. Then there is a sequence x^ € Q converging to .x with x,- ^ Mj. 
Since there are finitely many strata, we can assume that the whole sequence is contained 
in some stratum M. It follows that .x is a limit point of M. By the frontier condition of the 
Whitney stratification, it must be that dimM, < dimM, which is a contradiction since the 
stratum Mj was chosen to have maximal dimension. □ 
A set J/ C R" is said to be "generic", if it is large in some precise mathematical sense, 
depending on context. Two popular choices are that of U being a. full-measure set, mean- 
ing its complement has Lebesgue measure zero, and that of U being topologically generic, 
meaning it contains a countable intersection of dense open sets. In general, these notions are 
very different. However for semi-algebraic sets, the situation simplifies drastically. Indeed, 
if C R" is a semi-algebraic set, then the following are equivalent. 

• U is full-measure. 

• {/ is topologically generic. 

• The dimension of If^ is strictly smaller than n. 

We will say that a certain property holds for a generic vector v e R" if the set of vectors 
for which this property holds is generic in the sense just described. Generic properties of 
semi-algebraic optimization problems will be discussed in Section]?] 

Definition 2.20 Let A C R'" be a semi-algebraic set. A continuous semi-algebraic map- 
ping p:A — > R" is semi-algebraically trivial over a semi-algebraic set C C R" if there is a 
semi-algebraic set F and a semi-algebraic homeomorphism h: (C) C x F such that 
p\p-'{C) — proj °h, or in other words the following diagram commutes: 

p-'(C) — !^ CxF 

jproj 

c 

We call h a semi-algebraic trivialization of p over C. 
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Henceforth, we use the symbol = to indicate that two semi-algebraic sets are semi- 
algebraically homeomorphic. 

Remark 2.21 If p is trivial over some semi-algebraic set C, then we can decompose p\p-i{c) 
into a homeomorphism followed by a simple projection. Also, since the homeomorphism h 
in the definition is surjective and p\p-i(C) = proj oh, it follows that h(p~^(c)) = {c} x F for 
any c 6 C. Thus for any point c 6 C, we have (c) = F and (C) = C x (c). 

The following is a simple example of semi-algebraic triviality. 

Example 2.22 We follow the notation of Definition l2.20l Consider the semi-algebraic func- 
tion p: R — >■ R defined by p{x) = x^. Now consider the semi-algebraic mapping 

/!:R\{0} -^R++ X {±1}, x^(x^sgnx). 

It is easy to check that h is a semi-algebraic homeomorphism, and furthermore we have 
p = proj oh when restricted to R\ {0}. Thus h is a semi-algebraic trivialization of p over 

R++ 

Definition 2.23 In the notation of Definition 12.201 a trivialization h is compatible with a 
semi-algebraic set 5 C A if there is a semi-algebraic set H (Z F such that h{B n (C)) = 
CxH. 

If h is a trivialization over C then, certainly, for any set S C A we know h restricts to a 
homeomorphism from B n (C) to h{B D (C)). The content of the definition above is 
that if p is compatible with B, then h restricts to a homeomorphism between 5 n /7~ ' (C) and 
C X H for some semi-algebraic set H (Z F. Here is a simple example. 

Example 2.24 Let the semi-algebraic functions p and h be as defined in Example 12.221 
Now notice that /2(R++ fl /7~'(R++)) = R++ x {+!}. Thus h is compatible with R++. 

The following result will be used extensively in the rest of this work. See |37 , Chapter 
9, Theorem 1.2] for more details. 

Theorem 2.25 (Hardt triviality) Let A C R" be a semi-algebraic set and p:A — > R'", a 
continuous semi-algebraic mapping. There is a finite partition of the image p{A) into semi- 
algebraic sets C\,. . . ,Ck such that p is semi-algebraically trivial over each Cj. Moreover, if 
Qi , . . . , 2/ are semi-algebraic subsets of A, we can require each trivialization hi'. p^^iC,) — > 
C; X Fj to be compatible with all Qj. 

Example 2.26 Consider the following elaboration on Example l2.22l Let the semi-algebraic 
functions p and h be defined as in Example l2.22l We saw that /i is a semi-algebraic trivializa- 
tion of p over R++. Let /: {0} — > {0} x {0} be the zero map. Observe / is a semi-algebraic 
trivialization of p over {0}. Thus {R++, {0}} is a partition of p(R) guaranteed to exist by 
Theorem l2.25l 

Given a continuous semi-algebraic function p, Theorem l2.25l states that we can partition 
the image of p into semi-algebraic sets Ci, . . . so that for each index i = I,. . . ,k, the 
restricted mapping plp HQ) has a very simple form. By applying Theorem 12.251 to various 
naturally occurring mappings, many interesting results can be obtained. See f37! Chapter 9] 
for more details. In particular, by applying this theorem to the projection map we can break 
up semi-algebraic sets into simple building blocks that have product structure and analyze 
each one separately. This type of reasoning leads to the following corollary. 
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Corollary 2.27 Let F:K" ^ R'" be a semi-algebraic set-valued mapping. Then there exists 
a partition of the domain of F into semi-algebraic sets Xi,X2, ■ ■ ■ ,Xk with the following 
properties: 

1. For each index i = 1,2, ...k, there exists a semi-algebraic set Yj C R'" and a semi- 
algebraic homeomorphism 0i:gphF|x, — x Yi satisfying 

xF(x) ) = {x} X Yi for all xe X,-. 

Consequently, for all x 6 Xi, we have F{x) = Yi and 

g^hF\x,~XiXF{x). 

2. If in addition, F:R" ^ R™ is another semi-algebraic set-valued mapping with F{x) C 
F(x), then we may also require that for each index i = 1,2,. . . ,k, there exists a semi- 
algebraic set Yi C Yj, such that 9,(gphF|x,) = Xi x Yi. Consequently, for all x 6 Xi, we 
have Fix) = Yi and 

g^hF\x,?^XiXF[x). 

Proof Assume that we are given semi-algebraic set- valued maps F and F such that F{x) C 
F{x) for all x e R". If F was not given, proceed with the proof with = for all £ R". 
Consider gphF C R" x R™. Let p: gphF — > R" be the projection onto the first n coordinates. 
By applying Theorem l2.25l to p, we get a partition of the domain of F into semi-algebraic 
sets Xi,X2,...,Xk such that p is semi-algebraically trivial over each X, and each trivializa- 
tion is compatible with gph F . Thus there exist semi-algebraic sets Yi,Y2,...,Yk C R'" and 
Y\,Y2,...,Yi^ C R'" with Yi <zYi, such that for each /, there is a semi-algebraic homeomor- 
phism di'. p~^(Xi) — ^ Xi X Yi, where we have 

diigphFnp-'iXi)) =XixYi, 

projx,o0; = (2) 

Observe that p~^{Xi) = gphf jx, and since gphF is contained in gphF, it follows that 
gphF n p~^{Xi) = gphF\xi. Thus to summarize, we have 

gphFjx,=XiXY^, (3) 
gphF\x,~XiXYi. 

Finally, from (|2]i and it follows that for all points x E Xi, we have 

ei{{x}xF{x)) = {x}xYi, 

completing the proof. □ 
The following proposition appears in |T,T|; as observed there, this result is an easy and 
important consequence of Theorem l2.25l and even though we will not have occasion to use 
it in this work, we include it and its proof below as an elegant illustration. 

Proposition 2.28 Let F:R" ^ R"' be a semi-algebraic set-valued mapping. Then there ex- 
ists a finite partition of the domain of F into semi-algebraic sets X\, ... ,Xk, such that for 
each index i = I,. . . ,k, the restricted mapping F\x, is inner semicontinuous. If in addition, 
the mapping F is compact-valued, then we can also require the restricted mapping F\x; to 
be outer semicontinuous for each index i= I,. . .,k. (In fact, the partition guaranteed to exist 
by Corollarv \2.2 71 one such partition.) 
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Proof Applying Corollary 12.271 to the mapping F, we get a finite partition of the domain 
of F into semi-algebraic sets Xi,...,Xk, so that, in particular, property 1 of the corollary 
holds. To see the inner semicontinuity of the restricted map F\x,, consider any point £ 
gphFjx;, and any sequence of points x,- ^ in the setX,. We want to construct a sequence 
of points y,- e F(jc,-) converging to y. Notice that 0,(jc,y) = (x,y) for some point y £ Yj. Since 
{Xr,y) {x,y), we deduce d~^{xr,y) — ^ d~^{x,y) = {x,y). But for each index r, we know 
d~^{xr,y) = {xr,yr) for some point y^ £ F{x,-), so the result follows. 

Assume now that F is compact-valued. Consider any point x 6 X, and any sequence of 
points {xr,yi-) — > {x,y), where y is some point in R'" and y,- £ F{xr) for each r. We want 
to argue that y is in F{x). Consider the sequence (x,projy.(0,(jc,.,yr)))- Observe that this 
sequence is contained in {x} x Yj, which is a compact set since it is homeomorphic to F{x). 
Thus, without loss of generality, we can assume that (i;,projy.(0i(x,-,3',-))) converges to {x,y) 
for some point y £ Yj. So we have 

{x„yr) = er\x„pmiy^{ei{x,.,y,-))) ^ er\x,y) £ {x}xF{x). 

By the uniqueness of the limit, we must have y E F{x). □ 
As a consequence of Proposition l2.28l it follows that any semi-algebraic set- valued map- 
ping F:R" — )• R™ is generically inner semicontinuous. If, in addition, F is compact- valued, 
then F is generically continuous. In fact, we can do better. If we require the mapping F just 
to be closed- valued, then we can still partition its domain into semi-algebraic sets Xi,. . . ,Xk, 
such that for each index i= 1, . . . the restricted mapping is continuous. To see this, 
we need the following theorem that appears in f33', Theorem 5.55], and is attributed to 1131 
|24 , 35 1 . Recall that given a topological space X, a subset A of X is meager if it is a union of 
countably many nowhere dense subsets of X. 

Theorem 2.29 (Kuratowski) Consider a set X C R" and a closed-valued set-valued map- 
ping F:X ^ R™. Assume that F is either outer semicontinuous or inner semicontinuous 
relative to X. Then the set of points where F fails to be continuous relative to X is meager 
in X. 

It is easy to see that if a semi-algebraic set S is meager in another semi-algebraic set X, then 
the dimension of S is strictly less than the dimension of X (see |4] for more details). 

Proposition 2.30 Let F:R" ^ R™ be a semi-algebraic closed-valued set-valued mapping. 
Then there exists a finite partition of the domain of F into semi-algebraic sets Xi,. .. ,Xk, 
such that for each index i = I,. . . ,k, the restricted mapping F |x, is continuous. 

Proof Applying Proposition 12.281 to the mapping F, we get a partition of the domain of F 
into semi-algebraic sets Xi,. . . ,Xt, so that the restricted map F\x, is inner semicontinuous. 
Fix some set X,-. Let 5° := Xj and let 5' C X, be the set of points at which F\gi fails to 
be continuous. By Theorem 12.291 it follows that dim^' < dimS". Now by applying this 
argument inductively, we can create a sequence of semi-algebraic sets 5" 3 ... D 5*, for 
some integer k, such that the collection {Sj\ Sj+ \ } y^,' is a partition of X; and F is continuous 
when restricted to each Sj\Sj+i . By applying this argument to all the setsX,-, for / = I,. ..,k, 
we get the result. □ 

Remark 2.31 In fact, it is shown in Daniilidis-Pang 1181 that closed-valued semi-algebraic 
maps are generically strictly continuous (see [331 for the definition). Their proof of this 
rather stronger result requires more sophisticated tools. 

Finally, we have the following result: 
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Theorem 2.32 (|38, Theorem 4.4]) Let A be a semi-algebraic subset o/R" x R'". There is 
an integer jS such that for every point x 6 R", the number of connected components of the 
set Ax = {y 6 R'" : {x,y) £ A} is no greater than jS. 

The following is a simple special case of Theorem 12.321 We record it here for conve- 
nience. 

Remark 2.33 Let F:R" ^ R™ be a semi-algebraic mapping. Applying Theorem 12.321 to 
gphF C R" X R", we deduce that there is an integer P such that for every x e R", the number 
of connected components of Fix) is no greater than p. 

3 Main Results 

Definition 3.1 Consider a Whitney stratification jz/ of a semi-algebraic set Q C R". We 
define the normal bundle Nj;^ associated with the stratification si^ to be the union of the 
normal bundles of each stratum, that is 

N,^= U gph^M= U {(-^,>')eR"xR":^eM,y6AfMW}. 

In the definition above, since there are finitely many strata and for each stratum M £ , 
the semi-algebraic set gphA^^ is «-dimensional, we deduce that the normal bundle N^j is a 
semi-algebraic set of dimension n. 

Proposition 3.2 Consider a semi-algebraic set Q C R" and suppose it admits a Whitney 
stratification = {M, }. Then for any stratum Mj and any point x £ Mj, the Clarke normal 
cone, Nq{x), is contained in the normal space, Nm; (x). Consequently, the inclusion gph A'g C 
holds and so the graph of the Clarke normal cone has dimension no greater than n. 

Proof Observe that for any stratum Mj, we have the inclusion Mj C Q. Hence for any point 
X E Mj, the inclusion 

Nq{x)cNmj{x)=Nm,{x) (4) 

holds. Now fix some stratum M, and a point x £ Mj. We claim that the limiting normal cone 
Nq{x) is contained in Nm,{x). To see this, consider a vector v £ Nq{x). By definition of 

the limiting normal cone, there exist sequences (x,-) and (v,) such that x,- x and v,. — > v 
with Vr £ Nq{xi-). Since there are finitely many strata, we can assume that there is some 
stratum Mj such that the entire sequence (x,-) is contained in Mj. From we deduce 
A'e(^r) C NmXx,.), and hence Vy £ Nm {x,-). Therefore by Whitney condition (a), we have 
V £ Nm,{x). Since v was arbitrarily chosen from Ng{x), we deduce Nq{x) C Nm,{x) and thus 
Nq{x) = cl conv Nq{x) C Nm, {x), as we needed to show. □ 
Shortly, we will generalize this result to the graph of the Clarke subdifferential. We need 
the following simple result. We provide a proof for completeness. 

Proposition 3.3 Let A (Z^" be a semi-algebraic set and p:A ^ R™, a continuous semi- 
algebraic mapping. Let D be the image set of the mapping p. Then we have the inequality, 

dimD + mindimp~'(x) < dimA < dimD + maxdimp~'(;c). 

In particular, if there exists an integer k such that the set p~^{x) is k-dimensional for every 
point X £ D, then the equality, 

dimA = dimD + k, 

holds. 
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Proof Applying Theorem l2.25l to the mapping p, we obtain a finite partition of the set D 
into semi-algebraic sets {C,} such that p~^{Ci) = C; x p~'(c), for any c 6 C,-. Let C,- be a 
partitioning set satisfying dimA = dim (C,), and let c be any point in C,. Then we have 

dimA =dim/7~'(C,) = dimC, -|-dim/7~'(c) < dimZ)-|-maxdim/7~'(x). 

Let Cj be a partitioning set satisfying dimD = dimCj, and let c be any point in Cj. Then we 
obtain 

dimA > dim;7^'(C;) = dimC; -|-dim/7^'(c) > dimZ)-|-mindimp~'(jc), 

as we needed to show. □ 
We record the following simple and intuitive corollary for reference. 

Corollary 3.4 Let F:ti" ^ R'" be a semi-algebraic set-valued mapping and let D := domF. 
Then we have the inequality, 

dimD + mindimFfx) < dimsphF < dimD + maxdimF(x). 

x€D x€D 

In particular, if there exists an integer k such that the set F (x) is k-dimensional for every 
point X 6 D, then the equality, 

dim gph F = dim D-\-k, 

holds. 

Proof This is an easy consequence of Corollarv l2.27l □ 

Theorem 3.5 Let f:R"^Kbea semi-algebraic function. Then the graph of the Clarke 
subdifferential, gph dcf, has dimension no greater than n. 

Proof Let F : = epi / and 

A := {{x,r,y) £ R" x R x : {(,x,r),y) e gphiV^, r = f(x),y„+, < 0}. 
Using Proposition l3.2l we see 

dim A < dim gph A'^ < « + 1 . (5) 
Consider the continuous semi-algebraic map 

(SiA^R" xR" 



{x,f{x),y)^ (x,7i(--^)), 
^ \yn+i I ^ 



where 7i:R"~^^ — ^ R" is the canonical projection onto the first n coordinates. Observe that 
the image of (j) is exactly the graph of the Clarke subdifferential dcf. Furthermore, for any 
pair (x, v) £ gph dcf, we have 

(^-i(x,v) = {x}x{/(x)}xR+(v,-l), 

and hence dim(^^' (c) = 1 for any point c in the image of (j). By Proposition [33J we deduce 

dim gph dcf -f 1 = dimA <n-\-l, 
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where the last inequality follows from (O. Hence, we obtain dimgph(9c/ < n, as we needed 
to show. □ 
Shortly we will show that for a proper semi-algebraic function /: R" — > R, both gph dcf 
and gph (9/ have dimension exactly equal to n. In the case that the domain of / is full- 
dimensional, this fact is easy to show. The argument is as follows. By Theorem 12. 161 the 
domain of / can be partitioned into semi-algebraic manifolds {X,} such that f\x, is smooth. 
Let Xi be the manifold of maximal dimension. Observe that for x £ Xj, we have df{x) = 
\y f{x)] and it easily follows that dimgph(9/|x, = n. Thus we have 

n < dim gph (9/ < dim gph (9c/ < w, 



where the last inequality follows from Theorem 1 3. 5 1 and hence there is equality throughout. 
The argument just presented no longer works when the domain of / is not full-dimensional. 
A slightly more involved argument is required. We record the following simple observation 
for reference. 

Proposition 3.6 Consider a smooth manifold M C R" and a smooth real-valued function 
f:M^R. Define a function /; : R" — > R agreeing with f on M and equaling plus infinity 
elsewhere. Then h is subdifferentially regular throughout M. Furthermore, at any point x £ 
M, we have 

dh{x)=NM{x) + Vg{x), 

where g: R" R is any smooth function agreeing with f on M on a neighborhood of x. 
Consequently, dh{x) is nonempty with dimension n — dimM. 



Proof Observe that near the point x, we have h = Sm + g- Combining Theorem l2.8l and The- 
orem l2.12l we have that the function 5m is subdifferentially regular at x. By Theorem l2.11l 
it follows that h is subdifferentially regular at x and 

dh{x) = d5M(x)+Vg{x)=NM{x)+Vg{x), 

as we needed to show. □ 

Theorem 3.7 Let f:R"^Rbea proper semi-algebraic function. Then the graphs of the 
regular, limiting, and Clarke subdifferentials have dimension exactly n. 

Proof We know 

dim gph 5/ < dimgph^/ < dimgph^c/ < «, 



where the last inequality follows from Theorem 13.51 Thus if we show that the dimension 
of gph 5/ is no less than n, we will be done. With that aim, applying Theorem 12.161 to 
the function /, we obtain a Whitney stratification {M,} of the domain of / such that for 
every stratum M,, the restriction is smooth. Let Mj be a stratum of dom/ of maximal 
dimension. 

Now consider the function h: R" — >■ R, which agrees with / on Mj and is plus infinity 
elsewhere. By Proposition 12.191 the functions h and / coincide on a neighborhood of x. 
Applying Proposition 13.61 we deduce that / is subdifferentially regular at x and df[x) is 
nonempty with dimension n — dimM,. Since the point x was arbitrarily chosen from Mj, we 
deduce d\mdf{x) = n — dimMy for any point x 6 Mj. Thus applying Corollary 13.41 to the 
semi-algebraic set-valued map df\M, we deduce dim gph df\M = dimM^ + n — dimM,' = n, 
and hence the result follows. □ 

More refined, local versions of Theorem [3j7] are investigated in 1191 . 
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Theorem 13.51 shows that for a semi-algebraic function /: R" — ^ R, the Clarke subdif- 
ferential dcf is small in a dimensional sense. If / is also Lipschitz, it is small in another 
sense, that we now discuss: we relate our results to the notion of a minimal cusco (convex 
upper semicontinuous nonempty compact valued set- valued mapping), introduced in (8]. 
To that effect, consider a set A C R" and a set-valued mapping F:A =t R"'. We say that 
F is upper semicontinuous at some point x 6 A if every open set U containing F(x) also 
contains F{z) for all points z 6 A close to x. If a map is closed- valued and upper semicontin- 
uous, then it is outer semicontinuous. On the other hand, if a map is outer semi-continuous 
and locally bounded, then it is upper semicontinuous. See lilll Section 5.1.4] for more de- 
tails. In particular, for a Lipschitz function /, the Clarke subdifferential d^f is upper semi- 
continuous |14, Proposition 2.1.5]. The mapping F:A =S R'" is said to be a cusco if it is 
upper-semicontinuous on A and F(x) is a nonempty compact convex set for each point .v £ A. 
A minimal cusco is a cusco, whose graph does not strictly contain the graph of any other 
cusco. Let U C R" be an open set and consider a semi-algebraic locally Lipschitz function 
f:U ^R. It follows by a direct application of |8 Corollary 2.2] and generic smoothness of 
/ that the the set-valued mapping dcf is, in fact, a minimal cusco. 

It is tempting to think that in the semi-algebraic setting, the graph of an arbitrary minimal 
cusco should have small dimension. However, it is not hard to see that this is not the case. 
For instance, we will now exhibit a semi-algebraic minimal cusco F:K? =t R^, whose graph 
is 4-dimensional. Thus semi-algebraic minimal cuscos with low dimensional graphs, such 
as the Clarke subdifferential of a semi-algebraic locally Lipschitz function / defined on an 
open set, are somewhat special. 

To simplify notation, we let [y < 0, z < 0] be an alias for the set {{x,y,z) € R^ :y <0,z< 
0} and we reserve analogous notation for relaters '>' and '='. Consider the semi-algebraic 
set- valued mapping FrR-' =^ R^, defined as follows 

^Ilv>o,.>o| = {(0,0,0)}, F|l,,<o,,>o] = {(0,0,1)}, 

^Ilv<o,.<o] = {(0, 1,0)}, F|l,,>o.,<o] = {(1,0,0)}, 

^lLr>o,.=o] =conv{(0, 0,0), (1,0,0)}, F|l,,=o,z>o] = conv{ (0,0, 1), (0,0,0)}, 

^lb<o,z=o] =conv{(0, 1,0), (0,0,1)}, F|l,=o,z<o] = conv{(l,0,0), (0, 1,0)}, 

F I L,,=o,,=o] = conv { (0, 0, 0) , (0, 0, 1 ) , (0, 1 , 0) , ( 1 , 0, 0) } . 

It is easy to verify that F is indeed a minimal cusco with a 4-dimensional graph. In particular. 
Theorem 13.71 implies that F is not the Clarke subdifferential mapping d^f for any semi- 
algebraic function /: tt? ^ R. 

4 Consequences 

Definition 4.1 Consider a function /: R" R. We say that a point x 6 R" is Clarke-critical 
for the function / if £ dcf{x), and we call such a critical point x nondegenerate if the 
stronger property e ndcf{x) holds. 

Recall that for a proper convex function /: R" — > R and a point x £ dom/, the subdiffer- 
entials df{x), df{x), and dcf{x) all coincide and are equal to the convex subdifferential of 
/ at X. So in this case, the notions of Clarke-criticality and Clarke-nondegeneracy reduce to 
more familiar notions from Convex Analysis. The importance of nondegeneracy for the sen- 
sitivity analysis of convex functions is well known: in i26i . for example, it is an underlying 
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assumption for a pioneering conceptual approach to superlinearly convergent convex min- 
imization algorithms. Consider the following largely classical theorem (see |4, Proposition 
1] and 1201). 

Theorem 4.2 Let f:R"^Rbea proper convex function. Consider the collection of per- 
turbed functions hy(x) = f{x) — {v,x), parametrized by vectors v £ R". Then for a full mea- 
sure set of vectors v 6 R", the function hy has at most one minimizer, which furthermore is 
nondegenerate. 

Shortly, we will prove that a natural analogue of Theorem |4j2] holds for arbitrary semi- 
algebraic functions, with no assumption of convexity. We will then reference an example of 
a locally Lipschitz function that is not semi-algebraic, and for which the conclusion of our 
analogous result fails, thus showing that the assumption of semi-algebraicity is not super- 
fluous. In what follows, for a set S, the number of elements in S will be denoted by 5*. We 
begin with the following simple proposition. 

Proposition 4.3 Let F: R" ^ R™ be a semi-algebraic set-valued mapping whose graph has 
dimension no greater than n. Then there exists jS 6 N such that for a generic set of points 
c 6 R", we have F{c)* < jS. 

Proof Let D = domf . If the dimension of D is strictly less than n, then we are done since 
then the complement of D is a set satisfying the claimed property with jS = 0. Thus assume 
that D has dimension n. Applying Corollarv l2.27l to the mapping F, we get a finite partition 
of D into semi-algebraic sets {C,}, such that 

gphFic,. ^C, xF(c) 

for any c 6 C,. Let C, be a partitioning set of maximal dimension. So dimC, = n and we have 

n > dimgphFjc, =«-|-dimF(c) 

for any c 6 C,. Thus dimF(c) = and since F{c) is a semi-algebraic set, it must be finite. 
Since this argument holds for any C, of maximal dimension, we have that for a generic vector 
c, the set F{c) is finite. Observe that if F{c) is a finite non-empty set, then F{c)* is equal 
to the number of connected components of F{c). By Remark [2.331 there exists jS 6 N such 
that for all c 6 R", the number of connected components of F{c) is no greater than jS. So in 
particular, for generic c, we have F{c)* < jS. □ 

Corollary 4.4 Let f: R" — > R fee a semi-algebraic function and consider the collection of 
perturbed functions hy{x) = f{x) — {v,x), parametrized by vectors v € R". Then there exists 
a positive integer jS, such that for generic v 6 R", the number of Clarke-critical points of the 
perturbed function hy is no greater than jS. 

Proof Observe 

e dcK{x) <^ V 6 dcf{x) ^xe {dcf)~\v). 

Thus the set (<3c/)~'(^) equal to the set of Clarke-critical points of the function /i,,. By 
Theorem 1 3. 5 1 we have dimgph(9f./ < n, hence dimgph (<5c/)~' < «• Applying Theorem 14. 3 1 
to (dcf)~^, we deduce that there exists a positive integer jS, such that for generic v, we have 
{{dcf )~\v))* < jS. The result follows. □ 

Corollary 4.5 Let /: R" — > R fee a semi-algebraic function and consider the collection of 
perturbed functions hy{x) = f{x) — (v,x), parametrized by vectors v € R". Then for generic 
V € R", every Clarke-critical point of the function hy is nondegenerate. 
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Corollarv l4.5l follows immediately from the observation 

e ridchyix) ^vE ndcf{x), 

and the following result. 

Corollary 4.6 Let f: R" — > R foe a semi-algebraic function. Then for generic v 6 R", we 
have that 

X e {dj)-\v) =^ve ndj{x). 
Proof Let D = domdcf. Consider the semi-algebraic set- valued mapping 

F:R"=4R", x^ihdj{x). 

Our immediate goal is to show that the dimension of gphf" is no greater than « — 1. Ob- 
serve that for each ;c £ R", we have F{x) C dcf{x). Applying Corollarv l2.27l to the mapping 
dcf, we get a finite partition of D into semi-algebraic sets {X, }, such that 

gphdJ\x:=Xi X dj{x) 

and 

gphFU,^X,-xF(x) 
for any x 6 X, (for each ;'). By Theorem 1 3. 5 1 we have that 

n > dimgph(9c/U, = dimX; + dimdcf{x). 
Since F{x) = rb dcf{x), it follows that 

dimF{x) < dimdcf(x) — 1. 

Therefore 

dimgphFjx, = dimX,- -|-dimF(;c) < dimX; + dimdcf{x) — 1 < n — 1. 

Thus 



<n- 1. 



dimgphF = dim ^UgphF|x, 

i 

And so if we let 

M::gphF^R" 

be the projection onto the last n coordinates, we deduce that dim;r(gphF) < n — 1. Finally, 
observe 

;r(gphF) = |v e R" : v e ThdJ{x) for some jc e R" |, 

and so the result follows. □ 

Remark 4.7 Observe that if a convex function has finitely many minimizers then, in fact, it 
has a unique minimizer. Thus, for a proper convex semi-algebraic function. Corollaries 14.41 
andl4.5lreduce to Theoreml4.2l 



Remark 4.8 In Corollaries 14.41 and 14.51 if the function / is not semi-algebraic, then the 
results of these corollaries can fail. In fact, these results can fail even if the function / is 
locally Lipschitz continuous. For instance, there is a locally Lipschitz function /: R — >^ R 
such that dcf(x) = [—x,x\ for every ;c £ R. See the article of Borwein-Moors-Wang |9|. For 
all V 6 R, the perturbed function /i,, has infinitely many critical points, and for all v e R \ {0}, 
the function hy has critical points that are degenerate. 
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5 Composite Optimality Conditions 

Consider a composite optimization problem min g{F{x)). It is often computationally more 

X 

convenient to replace the criticality condition € d{goF) (x) with the potentially different 
condition 6 VF{xydg{F{x)), related to the former condition by an appropriate chain rule. 
See for example the discussion of Lagrange multipliers in [34|. Thus it is interesting to study 
the graph of the set-valued mapping x ^ VF{x)*dg{F{x)). 

5.1 Dimensional Analysis of the Chain Rule. 

The following is a standard result in subdifferential calculus. 

Theorem 5.1 H33\ Theorem 10.6] Consider a function g:R'" — > R and a smooth mapping 
F: R" ^ R™. Then at any point x 6 R", one has 

d{goF)(x)DVF(x)*dg{F{x)). 

Now assuming that the functions g and F in the theorem above are semi-algebraic, we 
immediately deduce, using Theorem 13.51 that the dimension of the graph of the mapping 
X I— > VF{x)*dg(F(x)) is at most n. One can ask what happens more generally in the case of 
the limiting and Clarke subdifferentials. It is well known that the inclusion 

d{goF){x)DVF{xydg{F{x)) 

is only guaranteed to hold under certain conditions |33, Theorem 10.6]. The Clarke case 
is similar 1141 Theorem 2.3.10]. Hence, a priori, the dimension of the graph of the set- 
valued mapping x i— > VF{x)*dg{F{x)) is unclear. In this section, we will show that if g is 
lower semicontinuous, then this dimension is no greater than n and we will derive some 
consequences. 

The proofs of Proposition [3]2] and Theorem 1 3. 5 1 are self contained and purely geometric. 
There is, however, an alternative approach using |5 Proposition 4], which will be useful 
for us. We state this proposition now. We denote the linear subspace of R" parallel to a 
nonempty convex set S C R" by par 5. 

Proposition 5.2 [5. Proposition 4] Consider a proper, lower semicontinuous, semi-algebraic 
function g:K"' R. Then there exists a Whitney stratification {M,} of the domain of g such 
that for each stratum Mi and for any point x 6 Mj, the inclusion paidc.g{x) C Nm^ {x) holds. 

Before proceeding, we record the following special case of Theorem 15.11 Consider a 
smooth function F:R" — > R'" and a nonempty set Q C R'". Consider any point x 6 R". 
Applying Theorem l5.1l to the indicator function of Q, we deduce 

N^-.fQ^{x)DVF{xyNQ{F(x)). 

If we let Q = F{X), for some set X C R", then we obtain 

Nf^'iF{x)){-^) 3 VF{xyNF(x){F(x)). (6) 

Theorem 5.3 Consider a proper, lower semicontinuous, semi-algebraic function g: R"' — > R 
and a smooth semi-algebraic mapping F:R" — > R™. Then the graph of the semi-algebraic 
set-valued mapping x i— > V F (x)* dcg{F [x)) has dimension no greater than n. 
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Proof Consider the Whitney stratification {M,} of domg that is guaranteed to exist by 
applying Proposition 15.21 to the function g. Now applying Theorem 12.161 to the mapping 
F, we obtain a Whitney stratification {X,} of R" and a Whitney stratification {Kj} of R'" 
compatible with {M,} such that for each index i, we have F{Xi) = Kj for some index j. Fix 
some stratum X and a point x E X. If F{X) is not a subset of the domain of g, then clearly 
VF{-)*dcg{F{-))\x = 0. Hence, we only consider X such that F{X) C domg. Let M be the 
stratum satisfying F{X) C M. Observe by our choice of the stratification {M, }, we have 

VF{xyd,g{F{x)) C VF{xyv + VF{xyNM{F{x)), 

for some vector v £ R'". Hence we have the inclusions 

parVF{xyd,g{F{x)) C VF{xyNM{F{x)) C VF{xyNf(x){F{x)), (7) 

where the last inclusion follows since the manifold F{X) is a subset of M. Combining ^ 
and (|7]i, we obtain 

parVF{xyd,g{F{x)) cNf '(F{x)){x) CNx{x), 

where the last inclusion follows since the manifold X is a subset of F~^ {F{X)). So we 
deduce 

dimV F{xydcg{F{x)) <n-dimX. 

Since the point x was arbitrarily chosen from X, we conclude, using Corollary 13.41 the in- 
equality dim gphV F {-y dcg{F {■y)\x < n. Taking the union over the strata {Xj} yields 

dimgphVF{-yd,g{F{-))<n, 

as we claimed. □ 
Observe that Theorem 15. 3 1 is a generalization of Theorem 13.51 This can easily be seen 
by taking F to be the identity map in Theorem 1 5. 3 1 

Remark 5.4 Consider a proper, lower semicontinuous, semi-algebraic function g: R'" — > R 
and a smooth semi-algebraic mapping F:R" R™ satisfying domgoF = F~'(domg) ^ 0. 
A natural question, in line with Theorem 13.71 is whether the graph of the mapping x i-^ 
VF{xydcg{F{x)) has dimension exactly n. In fact, there is no hope for that to hold generally. 
For instance, it is possible to have dcg{y) = for every point y in the image of F. This 
example, however motivates the following easy proposition. 

Proposition 5.5 Consider a proper, lower semicontinuous, semi-algebraic function g: R™ — > 
R and a smooth semi-algebraic mapping F: R" R™. Assume that the set F~ ' (dom dg) has 
a nonempty interior Then the graph of the set-valued mapping VF{-)*dg{F{-)) has dimen- 
sion exactly n. Analogous results hold in the limiting and Clarke cases. 

Proof We show the theorem only for the case of the regular subdifferential. The proof 
remains unchanged for the other cases. Clearly, as a result of Theorem 15.31 it is sufficient 
to show that the inequality, dimgphVF(-)*5g(f (•)) > n, holds. To that effect, consider an 
open set A' contained in the interior of the set (dom^g). Then for any point x E N, the 
set dg(F{x)) is nonempty. Hence, the the map x ^ VF(x)*5g(F(x)) has nonempty values 
on A'. In particular, letting k: R" x R" — > R" be the projection onto the first n coordinates, 
we see 

7t{gphVFi-ydg{F{-))\N)=N. 

Hence we conclude dimgph VF(-)*5g(F(-)) |a; > dimA^ = n, thus completing the proof. □ 
The following easy result, which we state without proof, gives a different approach. 
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Proposition 5.6 Consider a proper, continuous, semi-algebraic function g:W' — > R and a 
smooth semi-algebraic mapping F:R" R"'. If the constraint qualification of h33\ Theo- 
rem 10.6] holds at some point x (E F"'(domg), then the graph of the set-valued mapping 
V F {■)* d g{F [■)) has dimension exactly n. An analogous statement holds in the Clarke case. 

5.2 Consequences 

Let F:R" R™ be a smooth mapping and g:R'" — > R a proper lower semicontinuous func- 
tion. (For simplicity, here we assume that the mapping F is defined on all of R". However the 
whole section extends immediately to a mapping F defined only on an open subset U C R".) 
Consider the following collection of composite minimization problems, parametrized by 
vectors v 6 R" . 



For a point x to be a minimizer for P(v), the inclusion v 6 d{goF)(x) must necessarily 
hold. As discussed in the beginning of the section, it is often more convenient to replace this 
condition with the potentially different condition v 6 V F {x)* d g{F {x)) . This motivates the 
following definition. 

Definition 5.7 We say that a point x is Clarke critical for the problem (P(v)) if the inclu- 
sion V £ V F {x)* dcg{F {x)) holds, and we call such a critical point x nondegenerate for the 
problem (-P(v)) if the stronger property v £ nVF(x)*dcg{F{x)) holds. 

We are now in position to state a natural generalization of Corollaries l4.4l and l4.5l 

Corollary 5.8 Let F:R" — > R™ be a semi-algebraic smooth function and g:R'" — > R a 
proper lower semicontinuous semi-algebraic function. Consider the following collection of 
optimization problems, parametrized by vectors v £ R". 



Then there exists a positive integer jS, such that for a generic vector v 6 R", the number 
of Clarke-critical points for the problem (-P(v)) is no greater than jS. Furthermore, for a 
generic vector v 6 R", every Clarke-critical point for the problem (-P(v)) is nondegenerate. 

Proof Observe that by Theorem 1 5. 3 1 the graph of the mapping x ^ VF{x)*dcg{F{x)) has 
dimension no greater than n. The proof now proceeds along the same lines as the proofs of 
Corollaries |4j4] and |431 □ 

Observe that Corollaries l4.4l and l4.5l can be considered as special cases of Corollarv l5.8l 
in which the map F is the identity map. 

A noteworthy illustration of Corollarv l5.8l is the problem of constrained minimization, 
which we discuss now. Let /: R" — > R be a semi-algebraic function and D C R" a closed 
semi-algebraic set. Consider the following collection of constrained minimization problems, 
parametrized by vectors v 6 R". 



mm g{F{x)) - {v,x) 



mmg{F{x))-{v,x) 



min f{x) - {v,x) 

s.t. xeD 



Observe that (-P'(v)) is equivalent to the problem mmg{F{.x)) — {v,x), where we define 

x€R" 

F{x) = (x,x) and g{x,y) = f{x) + doiy). 
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Hence, in the sense of composite minimization, it is easy to check that a point jr £ D is 
Clarke critical for the problem (-P'(v)) if v 6 dcf{x) +N^{x), and such a critical point x is 
nondegenerate for the problem (-P'(v)) if the stronger property v 6 nd^f {x) + nN'j^{x) holds. 

Corollary 5.9 Let /: R" — > R be a semi-algebraic function and let D be a closed, semi- 
algebraic set. Consider the following collection of optimization problems, parametrized by 
vectors v £ R". 



Then there exists a positive integer jS, such that for a generic vector v £ R", the number 
of Clarke-critical points for the problem (-P(v)) is no greater than j6. Furthermore, for a 
generic vector v 6 R", every Clarke-critical point for the problem (-P(v)) is nondegenerate. 



Acknowledgements Thanks to Alex D. loffe for suggesting the extension we pursue in Section|5] 
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